311
Chapter 7
Whole Genome Pattern Discovery
her a DNA sequence or a protein sequence is the basic
mponent for most genomics research. It is well understood
t a sequence is the main carrier of genetic information for
y species. It is also no doubt that genomics research can
rdly be successful without looking into sequence
nstituents. Importantly, most novel and significant
coveries in biology or medicine are based on sequencing
a analysis nowadays. This chapter will introduce the
mmonly used sequence analysis approaches from the basic
es to the advanced ones and mainly focus on the sequence
mparison approaches for whole genome pattern discovery.
oreover, this chapter will show how the sequence
mparison approaches can be used to analyse the SARS-
V-2 pandemic data.
SARS-CoV-2 pandemic
VID-19 (Coronavirus Disease 2019) pandemic caused by SARS-
evere acute respiratory syndrome coronavirus 2) is still in a huge
ng situation worldwide since it firstly emerged in WuHan, China
mber 2019 [Zhou, et al., 2020]. Till the 30th March 2021, there
n more than 128.6 million infections and more than 2.8 million
orldwide as reported at worldmeter webpage. Until the 4th January,
has collected 315,253 SARS-CoV-2 DNA genome sequences
ost all countries in the world. Based on these huge volumes of
data, there are many questions which are waiting for answers